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Abstract —In this paper, we consider a multi-pair two-way 
amplify-and-forward relaying system where multiple sources ex¬ 
change information via a relay node equipped with large-scale an¬ 
tenna arrays. Given that channel estimation is non-ideal, and that 
the relay employs either maximum-ratio comhinlng/maximum- 
ratlo transmission (MRC/MRT) or zero-forcing reception/zero¬ 
forcing transmission (ZFR/ZFT) heamforming, we derive two cor¬ 
responding closed-form lower hound expressions for the ergodic 
achievable rate of each pair sources. The closed-form expressions 
enable us to design an optimal power allocation (OPA) scheme 
that maximizes the sum spectral efficiency under certain practical 
constraints. As the antenna array size tends to infinity and the 
signal to noise ratios become very large, asymptotically optimal 
power allocation schemes in simple closed-form are derived. The 
capacity lower bounds are verified to be accurate predictors of 
the system performance by simulations, and the proposed OPA 
outperforms equal power allocation (EPA). It is also found that 
in the asymptotic regime, when MRC/MRT is used at the relay 
and the link end-to-end large-scale fading factors among all pairs 
are equal, the optimal power allocated to a user is Inverse to the 
large-scale fading factor of the channel from the user to the relay, 
while OPA approaches EPA when ZER/ZET is adopted. 

I. Introduction 

Massive multiple-input multiple-output (MIMO) transmis¬ 
sion, in which a base station is equipped with hundreds of 
antennas for multiuser operation, is considered as one of 
the key enabling technologies for 5G [1]. In [2], it was 
first proposed for multi-cell noncooperative scenarios. Such 
large antenna arrays can substantially reduce the effects of 
noise, small-scale fading and inter-user interference, using 
only simple signal processing techniques with reduced total 
transmit power, and only inter-cell interference caused by 
pilot contamination remains [2,3]. Subsequently, the energy 
and spectral efficiency of very large multiuser MIMO systems 
were investigated in the single cell scenarios in [4], which 
showed that the power radiated by the terminals could be 
made inversely proportional to the square-root of the number 
of base station antennas with no reduction in performance 
when considering imperfect channel state information (CSI), 
and that the power could be made inversely proportional to 
the number of antennas if perfect CSI were available. 

Currently, massive MIMO combined with cooperative relay¬ 
ing is considered as a strong candidate for the development of 
future energy-efficient networks and has received increasing 
attention [5-10]. In the held of cooperative relaying, two-way 
relaying technique outperforms one-way relaying in terms of 
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spectral efficiency, since it employs the principle of network 
coding at the relay in order to mix the signals received 
simultaneously from two links for subsequent forwarding, and 
then applies the self-interference cancellation (SIC) at each 
user to extract the desired information [II, 12]. For the multi¬ 
pair two-way relaying with massive MIMO, [7] obtained the 
asymptotic spectral and energy efficiencies of the system ana¬ 
lytically with both maximum-ratio combining/maximum-ratio 
transmission (MRC/MRT) and zero-forcing reception/zero¬ 
forcing transmission (ZFR/ZFT) beamforming, supposing that 
the number of relay antennas approaches to inhnity and the 
transmit power of all users is equal. However, only asymptotic 
cases with perfect CSI and perfect SIC were studied and no 
closed-form expression for the ergodic achievable rate with 
hnite number of relay antennas was derived in [7]. In [9], 
the ergodic achievable rates were investigated with perfect 
CSI based MRC/MRT used at the relay, providing a capacity 
lower bound, the derivation of which involved asymptotic 
approximations. Neither [7] nor [9] considers imperfect CSI 
or power allocation (PA) problems. 

In the literature, instantaneous power allocation schemes 
based on instantaneous rate for regular scale MIMO rather than 
massive MIMO were presented for one way or two way AF 
wireless relay systems to improve system performance [IS¬ 
IS]. In massive MIMO systems, ergodic rate is usually used 
in power allocation because the instantaneous rate approaches 
the ergodic rate as the number of antennas tends to inhnity due 
to the law of large numbers, and such PA schemes are more 
practical with lower complexity than instantaneous rate based 
ones. In [10], an ergodic rate based optimal power allocation 
(OPA) scheme was proposed for a multi-pair decode-and- 
forward (DF) one-way relaying with massive arrays. Never¬ 
theless, power allocation has not been addressed in a massive 
MIMO two-way relaying system. Besides, there is no closed- 
form ergodic rate expressions derived for massive MIMO two- 
way relaying with ZFR/ZFT in the literature. 

This paper considers a multi-pair two-way AF relaying sys¬ 
tem where multiple sources exchange information via a relay 
node equipped with large-scale arrays. Assuming imperfect 
CSI estimation, the relay station employs the MRC/MRT and 
ZFR/ZFT beamforming to process the signals, respectively. 
First, utilizing the technique in [16,17], we derive for the 
hrst time two statistical CSI (SCSI) based closed-form lower 
bounds for the ergodic achievable rate in the case of arbitrary 
number of relay antennas (without resorting to asymptotic 
approximations) with MRC/MRT and ZFR/ZFT processing, 
respectively, based on the properties of Wishart and inverse 
Wishart matrices. Having obtained the closed-form expres- 
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sions, we are able to design an OPA scheme that maximizes 
the sum spectral efficiency under certain practical constraints. 
The proposed OPA scheme is based on geometric program¬ 
ming (GP) [13,18], which can be solved by conventional 
optimization tools, such as CVX [19]. Considering the massive 
MIMO properties, an asymptotically OPA is presented for the 
asymptotic regimes with closed-form solutions. The derived 
closed-form expressions for the achievable rate are verified to 
be accurate predictors of the system performance by Monte- 
Carlo simulations. Furthermore, in order to demonstrate the 
effectiveness of the developed OPA schemes, simulations 
of spectral efficiency are conducted under different system 
configurations, respectively, in comparison to the equal power 
allocation (EPA) schemes. 

The rest of the paper is organized as follows. We briefly 
describe the system model for the multi-pair two-way AF 
relaying in Section II. In Section III, two closed-form expres¬ 
sions for the achievable rate are derived for MRC/MRT and 
ZFR/ZFT, respectively, followed by asymptotic analysis. Then, 
an OPA and an asymptotically OPA are proposed by solving 
the sum-rate maximization based optimization problem in Sec¬ 
tion IV. Furthermore, simulation results under different system 
configurations are given in Section V to demonstrate the 
effectiveness of both derived rate expressions and developed 
OPAs. Finally, we draw our conclusions in Section VI. 

Notations: For a matrix X, we use Tr{X}, X^, X^ and X* 
to denote the trace, the transpose, the Hermitian transpose, and 
the conjugate, respectively. The symbol ||x|| indicates the 2- 
norm of vector x and dzapjx} denotes a diagonal matrix with 
X being its diagonal entries. Moreover, I^r denotes the N x N 
identity matrix, E[-] and Var[-] denotes the expectation and 
the variance operators, respectively, [a]^ denotes max{a, 0}. 
Finally, x ^ CAf{0, D^;) represents a circularly symmetric 
complex Gaussian vector x with zero mean and covariance 
matrix 

II. System Model 

Fig. 1 shows the considered multi-pair two-way AF relaying 
network, where K pairs of users communicate with the help 
of a common relay station by sharing the same time-frequency 
resources. In this system, two single-antenna users in the 
Zth user pair denoted by {21 — 1, 21) or {21,21 — 1) (for 
Z = 1, • • • , K) want to exchange information with each other 
via the relay equipped with N (N ^ 2K ^ 1) antennas. 
Notably, the direct links between the corresponding users 
are assumed non-existing in the two-way relaying system. 
Typically, a two-way network is divided into two phases, 
namely the multiple-access (MA) phase and the broadcast 
(BC) phase [11]. In the MA phase, information is sent from 
the user pairs to the relay; while in the BC phase, the relay 
broadcasts the processed information. 

Fet Pi (i = 1,2, ••• ,2K) and Pr denote the power 
transmitted by user i and the relay corresponding to the 
MA and BC phases, respectively. We assume that all the 
channels between the users and the relay follow independent 
and identically distributed (i.i.d.) Rayleigh fading and time 
division duplex (TDD) is adopted in all transceivers. Thus, 


First phase 



Fig. 1: System diagram of multi-pair two-way AF relaying. 

supposing that gi € (t = 1, 2, • • • , 2A') is the channel 

between the ith user and the relay, gi contains the i.i.d. 
CAf{0,a-f) elements, where af represents the corresponding 
large-scale fading coefficient. In this way, we can denote the 
channel matrix between all the users and the relay accounting 
for both small-scale fading and large-scale fading by 

g = hdi/ 2 = [g,,g2,...,g2^] (1) 

where H G ([^nx 2 K j j (j CAf{0, 1) small-scale 

fading coefficients, and D is the large-scale fading diagonal 
matrix with the Zth diagonal elements denoted by cr? {i = 
1,2,... ,2K). 


A. Channel Estimation 

Practically, the channel matrices in both the MA and BC 
phases have to be estimated for relay processing. However, due 
to the large-scale antenna array at the relay, channel estimation 
at the user side becomes rather impractical. Thus, time division 
duplex (TDD) is adopted here and channel reciprocity can be 
utilized, i.e., only channel matrix G between all the users and 
the relay has to be estimated based on the uplink training. The 
relay then has the estimated CSIs of all uplink and downlink 
channels. The required channel related information at the user 
side can be calculated by the relay and fed back to the users, 
as will be explained later. At the beginning of each coherence 
interval T, all users simultaneously transmit pilot sequences 
of length T symbols. The pilot sequences of all the 2K users 
are pairwisely orthogonal, i.e., t > 2K is required. Then the 
training matrix received at the relay is 

Yr = -y/rppG^ + Nr (2) 

where pp is the transmit power of each pilot symbol. Nr is 
the additive white Gaussian noise (AWGN) matrix with i.i.d. 
components following CJ\f{Q,a^), the training vector trans¬ 
mitted by the ith (i = 1, • • • , 2K) user is denoted by the ith 
row of G satisfying = I 2 R-. Moreover, since 

the rows of pilot sequence matrices are pairwisely orthogonal, 
we have = 0 (V* ^ j G {!,..., 2A"}). 

In order to estimate the channel matrices G, we employ the 
minimum mean-square-error (MMSE) estimation at the relay. 
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The MMSE channel estimates are given by [20] 

G = —^ GD + —^NrD (3) 

where we define D = ^ ^-rpp" and Nr = Nr$^. 

According to the property of we conclude that Nr is 
composed of i.i.d. CAf{0,a^) elements. Then, 

G = G + H = [gi,g2,...,g2ic]eC'^^2if (4) 


where H = [■^i,C 21 • ■ •;C 2 Jf] denotes the estimation error 
matrix which is independent of G from the property of 
MMSE channel estimation [20]. Hence, we have G ~ 
0,Dj with D = diag\^af,( 


0, D — D ) with D — D = 


with i = 1, 




and H 

diag 


■ ■ • ’ 

j 

rpp cr^ 

and ct|. 




rpptT^+CT^ 


,2K. 


B. Data Transmission 


Since the relay station estimates all the channels, it employs 
linear processing MRC/MRT and ZER/ZET based on the 
imperfect CSI. While each user only has the knowledge of its 
pairwise effective channel coefficient for data detection and 
self-interference cancellation coefficient for SIC, which are 
calculated and sent out by the relay. In the MA phase, all the 
users transmit their signals simultaneously to the relay. That 
is, the received signal at the relay station is given by 

2K 

r = X! + rir = Gx -f nj. (5) 

where G = GP, P = diag { ^/pl, ... y/P 2 K} with each 

power satisfying 0 < pi < x = [a:i, X 2 ,..., X 2 k]^ with 
the ith element Xi representing the transmitted signal by the 
ith user and E [xx^] = I 2 R-, r € and nj. is the additive 

white Gaussian noise (AWGN) vector at the relay with zero 
mean and the variance of cr^. 

Then, in the BC phase, the relay multiplies the received 
signal by a linear receiving and precoding matrix to yield the 
relay transmitted signal given by f = Pr G where F 

is the combined beamforming matrix at the relay and its ex¬ 
pression will be given in the next subsection. The transmitted 
signal satisfies the expected transmit power constraint at the 
relay [21], i.e.. 


Pr = E 


= Tr' 


{e[f( 


GG^+a^Ijv 


)F«]}. 


(6) 


2K 

with a total power constraint Pi Pr < In the BC 

2 = 1 


^Here, P and Pq are two constants preset for the total power constraint 
and individual power constraint, respectively. 


phase, the received signal at the k'ih user can be expressed as 
Vk' = gfc'f + nk' = v^gfc'FgfcXfc + y/^gl,Fgk'Xk' 


2K 

gfF 


desired signal 

y/]hg^x^ + gfc/Frii. 


self-interference 


Uk' 


noise from relay noise at user 


inter-pair interference 


(7) 


where (fc, k') is defined to indicate the [A:/2]th^ user pair, and 
Uk' represents the AWGN noise at the k'ih user side with zero 
mean and variance of cr^. 

Using the estimated CSI, the relay calculates and sends out 
the SIC coefficient g^Fgfc (/c G {1, • • • , 2iC}) for each user. 
Hence, the received signal at the A:'th user after SIC is rewritten 
as 


Vk' = v^gfc'Fgfcajfc + ^/^XyXk' 


desired signal 
2K 

i^k,k' 


residual self-interference 

Ti 


rik' 


( 8 ) 


inter-pair interference 


noise from relay noise at user 


where the residual self-interference involves Xk' = 
g|]Fgfc/ - g^,Fgfc/, since the SIC coefficient gl,Fgk> for user 
k' is obtained from the estimated CSI. Here, we suppose that 
there is no error during the SIC coefficients transmission from 
the relay. 


C. MRC/MRT Processing 

In this subsection, the simple and widely used MRC/MRT 
beamforming is adopted. According to [22], the imperfect CSI 
based MRC/MRT beamforming is given by 

F = aiG*TG^ (9) 


where T = dzap {Ti, T 2 ,... Tr-} is the block diagonal 
permutation matrix indicating the user pairing format with 
Ti = T 2 = • • • = Tr- = [0 1; 1 0], and ai is a normalization 
constant, chosen to satisfy the power constraint at the relay 
station in (6). 

By substituting (9) into (6), we have 


(g) Pr 




2l 



2'| 

E 

G*TG^Gx 


+ E 

G*TG^nr 





Pr 



r ^ ~\ 

\ 

N{N + 1) 

2 (vk + (t 2 ) $ + (W + 1) E 

i=l 


( 10 ) 


where 


K 

^ = = ^2r-l^2iAi =P2»-1<t|*-1 + P2rdli 

2=1 

K 

^ = XI (P2i-icr|,_i +P2i(jli). 

2=1 


( 11 ) 


^It is clear that [fc/2] = [fc'/2]. 
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The detailed derivation of the equation is given in Appendix 
A. 


D. ZFR/ZFT Processing 

When employing ZFR/ZFT with imperfect CSI, in which 
the pseudo-inverse of the estimated channels in (4) are needed 
for processing, the linear beamforming is given by [22] 

F = asG* ^t(g^g) ^G^ = a 26 *T 6 ^ 

-1 

where G = g(g^g) and 012 is the normalization 
constant, chosen to satisfy the transmit power constraints at 
the relay. Notably, SIC is not necessary as ZFR/ZFT leads to 
g^/Fgfc/ = 0 (Vfc' € {1, • • • , 2K}). On the basis of (12) and 
Tr{AB} = Tr{BA}, we have 


O) 
Q!2 — 


\ 


Pk 



-V * - H ^ 

2‘ 


/s * ^ H 

2~ 

E 

G TG Gx 


-bE 

G TG n. 



(b) 


Pr 


(13) 


\.4 


2K 

1 


E iN-2K-l)Fi + ^ E 


2K 

where ?} = E (N- 2 K){N- 2 K- 3 )a'^Fp 

of (13) is given in Appendix B. 


. The detailed derivation 


III. Achievable Rate Analysis 

In this section, a general form of the ergodic achievable rate 
of the transmission link k ^ k' for MRC/MRT processing is 
given first, followed by a rate expression for ZFR/ZFT. In 
order to obtain a basic and insightful expression that can be 
used for power allocation optimization, a simplified capacity 
lower bound is derived utilizing the technique of [16,17], in 
which the received signal is rewritten as a known mean times 
the desired symbol, plus an uncorrelated effective noise. The 
worst-case uncorrelated effective noise, where each additive 
term is treated as independent Gaussian noise of the same 
variance, is employed to derive a lower bound. 

From (8), the ergodic achievable rate of the transmission 
link fc —>■ fc' is expressed as (14). 

Remark 1: Here, the ergodic achievable rate is valid based 
on the assumption that the receiving user k' knows perfectly 
g^,Fgfc in the detection process. To demonstrate the accuracy 
of the derived lower bounds, we compare the lower bounds 
with Monte-Carlo realized (14) in Section V. The normaliza¬ 
tion constant for F in (14) is assumed to be calculated based 
on instantaneous CSI by satisfying Pr = ||f||^. 

Further derivation of (14) is difficult because of the in¬ 
tractability to carry out the ensemble average analytically. 
Instead, we adopt the technique in [16] to derive a worst- 
case lower bound of the achievable rate. The first step is to 
rewrite v^Sfc'Fgfc^fe in (8) as the sum of [gfc/Fgfc] Xk 

and (gfc'Fgfc - E [g^,Fgfc]) Xfc, where the first part is 


now considered as the “desired signal”. That is, (8) can be 
expressed as 

Vk' = s/Pi^ [gfc'Fg/c] Xk + fik' (15) 

'---' . 

desired signal effective noise 


where fik' is considered as the effective noise and given by 


fik' =s/Pk (gfe'Fgfc - E [gfc/Fgfc]) Xk + ^y^Xk'Xk' 

2K 

+ gl>F V^^SiX^ + gl>Fn, + Uk'■ 

i^k,k' 


(16) 


It is straightforward to show that the first term “desired 
signal” and the second term “effective noise” in (15) are 
uncorrelated. The exact pdf of nk> is not easy to obtain, but 
we know that the worst-case is to approximate the effective 
noise as independently Gaussian distributed [16]. Since the 
relay is equipped with large-scale antenna arrays by assuming 
N ^ 2K ^ 1, the central limit theorem provides a tight 
statistical CSI based lower bound for the achievable rate. Then, 
the statistical CSI based achievable rate lower bound of the 
transmission link k ^ k' can be obtained as 
= 

W (l + _ Pk\F[gl,Fsk]\" _\ 

PfcVar [glFgk] + SW + IP^. + + NUfe. J 

(17) 


where SR/, IPfc/, NR^/ and NUfe/ denote the residual self¬ 
interference after SIC, the inter-pair interference, the amplified 
noise from relay and the noise at user, respectively, i.e.. 


SR/ =pfc/E 
2K 


gfc/Fgfc/ - gfc/Fgfe/ 
2 


iPfe/ = y] P.E [|g^,Fg 

i^k,k' 


NRfc/ =E 


[|gJ,Fn,|"] , NUfc/ =E[|nfc/ 


(18a) 

(18b) 

(18c) 


When MRC/MRT beamforming is employed, further math¬ 
ematical derivation of (17) leads to the following theorem: 

Theorem 1: With imperfect CSI based MRC/MRT, the 
ergodic achievable rate of the transmission link k 
k', for a finite number of antennas at the relay, is 
lower bounded by (19), where Ofc/ = N {N + 1) , 

= (^ +1) = 
2l>cr?cr2 + (iV -f 1) afal] , cfc/ = 2 [{N + 1) {al,- 

4'^ = [(^ + 1) ^k^k' 


2<5-fe/) o-feCT^/ + {crt - 2al,) 4> 

+2alM, and 4/^ = 

Proof: See Appendix C. 


For imperfect CSI based ZFR/ZFT processing, a closed- 
form expression for the achievable rate in (17) is derived as 
follows: 


Theorem 2: With imperfect CSI based ZFR/ZFT beamform¬ 
ing, the achievable rate of the transmission link k k', for 
a finite number of antennas at the relay, is lower bounded 

by (20), where ew = = 1, + 
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= E 


■ / 

= r , Pk\gl,FSk 

/ V 

--82 - T 2 2K 

Pk'l^k'l + E Pi 1 
\ i^k,k' 

slFg,\ 

^ + gf'F E 2 -t-cr2^ 


(14) 


MRC/MRT : = log 


^k'Pk 


^ I Jr 

y Zj ^) p* + Ck'Pk' + {d[} + d [}^ 


ZFR/ZFT: 7 fpi = log. 


^k'Pk 


-- 

2K 

^ E (/fe'i + fiZ^R^) P* + ^k'Pk' + (ni^^ + 


(19) 


( 20 ) 


_^ j_ „-2 ^2 ' ^(2) _ : 

{N-2K-l)a'f, + Jk',i — 

4 - (1) cr^ I 2 2 -' j 4 - 

mfe' = = (N-2K-l)ai +^n^£.y^^ ’^fc' = ’^nV- 

Proof: See Appendix D. 

Theorems 1 and 2 are also valid for conventional MIMO 
systems, while the bounds become less tight as the antenna 
scale goes down. The capacity lower bounds for perfect CSI 
can always be obtained by setting cr|^ = 0 and of. = cr^ (fc £ 
{1, • • • , 2A"}) in (19) and (20). Moreover, it can be observed 
from (19) that when the estimation error is severe, the residual 
SI occupies the major part of the imperfect CSI effect in 
comparison to other terms. On the other hand, if channel 
estimation is rather accurate, the residual SI has slight effects 
in comparison to other terms. While for ZFR/ZFT, both the 
residual SI and inter-pair interference are determined by the 
channel estimation accuracy. 


fN-2K-l)a^., 

^2.^2 - A'2) 




)■ 


A. Asymptotic Analysis with Massive Arrays 

Based on the derived closed-form expressions for the 
achievable rate in (19) and (20), this section provides the 
asymptotic analysis under two different cases when the number 
of relay antennas approaches to infinity. Suppose that all users 
have the same transmit power, i.e., Pi = P 2 • • • = P 2 K = P’s- 

Proposition 1: In case I where pp is fixed, pi = P 5 = 
^ (i = 1,2, • • • , 2 K), Pr = and Es and Er are fixed, 
to achieve non-vanishing user rate as N ^ 00 , the user and 
relay transmit power scaling factor p and 6 must satisfy 0 < 
p < 1 and 0 < 0 < 1. When p = 1 and 0=1, the asymptotic 
achievable rate expressions of the transmission link k ^ k' 
for imperfect CSI based MRC/MRT and ZFR/ZFT are (21) 
and (22), respectively, which show that the transmit powers at 
both users and relay sides can be scaled down by up to ^ to 
maintain a given rate in case I. When p < 1 and 0 < 1, the 
asymptotic achievable rate of each user approaches to infinity 
as N ^ 00 . 

In case II where pp = pi = Ps = ^ (i = 
1,2,-•• ,2K), Pr = and Eg and Pr are fixed, to 
achieve non-vanishing user rate as N ^ 00 , the pilot, user and 
relay transmit power scaling factors <^, p and 0 must satisfy 
0<ij<l, 0<p< 1 —<; and 0 < 0 < 1 —<t. When 0 < <r < 1, 


p = 1 — and 0 = 1 — <j, the asymptotic achievable rate of the 
transmission link k ^ k' for imperfect CSI based MRC/MRT 
and ZFR/ZFT are (23) and (24), respectively, from which we 
conclude that the transmit powers of each user and the relay 
can only be reduced by up to when the pilot transmit 

power is set as pp = in order to maintain a given spectral 
efficiency. Similarly, when p < \ — (, and 0 < 1 — the 
asymptotic achievable rate of each user approaches to infinity 
as N ^ 00 . 

Remark 2: When the pilot power scaling factor <? = 1, which 
means that the pilot power scales down by to guarantee 
user rate there is p = 0 = 0, which means the relay and user 
transmit power must stay constant and do not scale down with 
N. The achievable rate in this case can be derived from (19) 
and (20), but not shown here due to space limitation. It is 
found that channel estimation error induced interference and 
inter-pair interference cannot be eliminated when pp is scaled 
down proportionally to ^ with fixed pi and Pr in case II. 

It can be observed from Theorems 1 and 2 that for fixed 
(Ti (z = 1, • • • , 2K), an, and pp, the achievable rate of each 
pair-wise user transmission link depends on the user power, 
i.e., the values of p^ (i = I,-- - ,2K), and the relay power. 
Next we propose the optimal power allocation for the studied 
system. 


IV. Power Allocation Schemes 

In this section, a power allocation problem is first formu¬ 
lated and solved for multi-pair users in the MA phase trans¬ 
mission and the relay in the BC phase, which maximizes the 
sum spectral efficiency^. The achievable rate of a transmission 
link 7 fc for k £ {1, • ‘ ’ i 2K} used in the optimization refers 
to the SCSI based achievable rate, given in (19) and (20). 
Power allocation can be performed at the relay side according 
to the SCSI, and then the relay notifies the user pairs their 
allocated power values. Moreover, closed-form asymptotic 
power allocation solutions are presented for MRC/MRT and 

^The objective of power allocation can also be minimizing the total power 
consumption or maximizing the minimum achievable rate, which can be 
formulated and solved using the similar method. 
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/ 


MRC/MRT ; ^ 

N—¥oo 


log 2 


1 + 


EsEjialal, 


2K 

\ Esal j2 + ERcrlalaf, + 

^ 2=1 / 


/ 


ZFR/ZFT : ^ log^ 

iV-»oo 


/ 


EnEso-l 


\ 


1 ^ 2 if 

y ErCTu + CTfcCr2 ^|f + j y 


MRC/MRT : ^ log 2 

Af—>00 


V 


rEpaf, 


T^El,EsERalal, 


\ 


( 


ZFR/ZFT ; 7 ^,^®^ iug 2 


log 2 


1 + 


2K 

Es E crfaf, + Epa^al 

i^l 

t^E^EsEr(t^ 


- 2 -S E 

J =1 


'7 


\ 


N—^oo 

^ rEpEpaf^ + a 


. 4_4 / rEpEp, , <T% ^ 


( 21 ) 


( 22 ) 


(23) 


(24) 


ZFR/ZFT, respectively, for the asymptotic regimes with high 
SNR and N ^ 00 . 


A. Optimal Power Allocation (OPA) 


Most power optimization in communications aims to maxi¬ 
mize the sum spectral efficiency, which is defined as the sum- 
rate (in bits) per channel use. Assuming that T is the length of 
the coherent interval (in symbols), in which t symbols are used 
for channel estimation, the sum spectral efficiency"^ denoted 

. 2K 2K 

as S = Elk = '^~r~^ log 2 fl (^ + Xk), where 

k—1 k—1 


Xk = 


^kPk' 


- is the signal- 

to-interference-plus-noise ratio (SINR). Then, the power allo¬ 
cation problem to maximize the sum spectral efficiency can 
be formulated as 


'T' 2 

max - - -log 2 n (1 + '^25a) 

Pi,PR 1 

k—\ 

2K 

s.t. + Pr < P (25b) 

i=l 

0 < Pk < Po, 0 < Pr < Pr.o, k = 1, - ■ ■ , 2K (25c) 


where P in constraint (25b) is the total power allocated to 
all the users and relay, and constraints (25c) specify the 
peak power limits Pg and Pr g for each user k and relay, 
respectively. The objective in (25a) can be equivalently rewrit- 

'2K I 


ten as min 

Pi,PR 


n (1 

U=i 


Xk) 


, as log 2 (a;) is a monotonic 


increasing function of x. We can see that the constraints are 
posynomial functions. If the objective function is a monomial 
or posynomial function, the problem (25) becomes a GP 


'*Here, the loss due to relay sending out pairwise effective channel coeffi¬ 
cients and SIC coefficients is taken into account. The transmission is assumed 
to be perfect and the overhead is 2 (symbols) for the pairwise effective channel 
coefficient and SIC coefficient per user. 


which can be reformulated as a convex problem, and thus, 
can be solved efficiently by convex optimization tools, such as 
CVX [19]. Flowever, the rewritten objective function for (25) 
is still neither a monomial nor posynomial, making solving the 
problem directly by the convex optimization tools impossible. 
To solve this problem, an approximation for the objective 
function can be efficiently found by using the technique in 
[23]. Specifically, according to [23, Lemma 1], we can use 
a monomial function Hkxj^ to approximate (1 + Xfc) 
an arbitrary point Xk > 0 , where ijk = Xfc(l + Xfc)~^ and 
p'k = Xk'^'° (1 + Xfc)- Consequently, the objective function can 

2K 2K 

be approximated as n (1 + Xfc) ~ n t^fcXfc^ which is a 

fc=l k=l 

monomial function. In this way, the problem is transformed 
into a GP problem by the approximation. 

Similar to [23], a successive approximation algorithm for 
the power allocation problem in (25) is proposed as Algorithm 
1. (26b) is the relaxed constraint. Notably, the parameter /3 here 
is utilized to control the desired approximation accuracy. The 
accuracy is high when /3 is close to 1, but the convergence 
rate is low, and vice versa. As shown in [23], (3 = 1.1 is 
an option that introduces a good accuracy trade off in most 
practical cases. 


B. Asymptotically Optimal Power Allocation (AOPA) 

Obviously, the optimal power allocation scheme in Algo¬ 
rithm 1 is an iterative numerical solution with no closed- 
form. Flowever, more tractable expressions can be found for 
MRC/MRT and ZFR/ZFT, respectively, when we consider the 
asymptotic regimes with high SNR and N ^ oo. 

1) AOPA for MRC/MRT: Suppose that the SNRs at both the 
relay and user sides are very high, i.e., Pr ^ pp ^ cr^ 
and Pi cr^ (i = 1, • • • , 2K), and N K ^ 00 . Then 
the lower bound can be simplified as given by the following 
Lemma. 
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Algorithm 1: Successive approximation algorithm for power allocation 

Initialization: Given tolerance e > 0, the maximum number of iterations L, and parameter /? > 1. Set m = 1. 
Select the initial values Xk,i for Xfc (^ = 2, • • • , 2K). 

Repeat: 

1) Compute T]k,m ~ Xfc,m(f “b Xk,m) and ~ Xk,m 

2) Solve the GP: 


min 

Pi,PR,Xk 

s.t. 


2K 

n 


'flu „ 


-1 


CLkPk' 


2K 

S ^ ) Pi + CkPk + 

i—1 ^ 


')■ 


(‘ 


,(i) 




') 


< Xfc 


(25b), (25c), /3 ^Xk,Rx < Xfc < /^Xfe.r 


(26a) 

(26b) 

(26c) 


3) Set TO = TO + 1, and update Xfc.m = Xfc> where xX — f i 2, • • 
p* (i = 1,2, • • • , 2K) and P* of the GP; 


, 2K) are obtained based on the solutions 


Until: Stop if max 


Xk,m-Xk,: 


< e or TO = L; 


Output: Output p* (i = 1,2, ■ ■ ■ , 2K) and P^ as the solutions. 


Lemma 1: When Pr cr^, pp cr^, pi cr^ (i = 
1, • • • , 2K), and N ^ K ^ oo, the rate of the transmission 
link k ^ k' can be approximated as 




l0g2 


1 + 




k^k'Pk 


2K 




(27) 


i^k' 


Proof: Firstly, we have bf « of for i = 1, • • • , 2K due 

to pp ^ Then, we divide both the denominator and 

numerator of the SINR in (19) by (iV+1). Each item with 

in the denominator is able to be ignored based on N ^ oo. 
2 

Then, according to ^ ~ 0 and pi ^ cr^ for i = 1, - ■ ■ , 2K, 
(27) can be obtained. 

In order to obtain a closed-form solution for asymptotically 
optimal power allocation, we set the fixed link condition 
crfa'^, = C for i = 1, • • • , 2K. Since the approximated rate 
expressions involve no Pr, we do not need to find the optimal 
solution for Pr. For a fixed Pr, (25) can be rewritten as 


max 

Pi 


n 


/ 

1+ 

V 


\ 

Nal,pk' 

2K 

2 E (^IPi 

i^k / 


2K 

s.t. X! Pi < P - Pr 
2 = 1 


0 < Pfc, fc = 1,2, - • • ,2K. 


(28a) 


(28b) 

(28c) 


where the peak power constraints for each user are ignored for 
analysis simplicity by assuming that the channel large-scale 
factors are on the same order of magnitude. Then, we have 
the following Theorem 3 with regard to the optimal allocated 
power for each user. 


Theorem 3: The optimal solution to (28) is obtained as 


Pi = 


P-Pr 

2K 

fc=i 


(29) 


Proof: Since 


2K 


2K 


2K-1 


E ^fpi - w E (^fPi 


■ 0 [24] in 


i^k' i—1 

probability when K ^ oo, according to the inequality of 
arithmetic and geometric means, the objective in (28a) is upper 
bounded by 


2K 


2K {2K - 1) X] ^iPi + NKal,pk> 

n- 

/c=l 


2K 

{2 K-l)j:afp. 

i=l 


< 


, 2K \ 

' 2K {2K - 1) X) ^"iPi. + NKcrl,pk' 

E— 


2K 


2K 




(^2K ^ 2{2K-1) 


2iT(27T-l)^a2p, / 

i=l ' 

N 


2K 


(30) 


where the equality is achieved if and only if alpi = A for 
Vi G {1, • • • , 2K}. To maximize the sum rate, it is obvious 
that the total user power should reach the largest value P — 

2K 

Pr in (28b). Thus (29) can be obtained to satisfy Pi — 

i—1 

P — Pr, indicating that the optimal allocated power for each 
user is inverse to its corresponding large-scale fading factor 
of the channel from the user to the relay, and proportional to 
the channel from the relay to its destination when MRC/MRT 
beamforming is used at the relay under the condition that the 
link end-to-end large-scale fading factors among all pairs are 


























equal. Furthermore, the asymptotic sum-rate is independent of 
the allocated power. 

2 ) AOPA for ZFR/ZFT: Similarly, we make the same as¬ 
sumption for ZFR/ZFT that the SNRs at both the relay and user 
sides are very high and N K ^ oo. Then the following 
Lemma can be obtained. 

Lemma 2: When Pr » cr^, pp » cr^, pi > (/ = 

1, • • • , 2K), and N ^ K ^ oo, the rate of the transmission 
link k ^ k' can be approximated as 






(31) 


Therefore, without any assumptions on link conditions, (25) 
can be rewritten as 


2K 


max 

Pi 


^ ^ {N - 2K - 1) al,pk' 


fe=i 

s.t. (286), (28c) 


(32a) 

(32b) 


Theorem 4 states the optimal allocated power for each user. 
Theorem 4: The optimal solution to (32) is given by 

_2 n + 


Pi = 


[N-2K-l)aU 


(33) 


where A = 2K j {N- 2 K-i)a‘^ 'fj chosen to satisfy 

(28b). 

Proof: Obviously, we can obtain the solution to (32) by the 
Lagrange multiplier approach associated with Karush-Kuhn- 
Tucker (KKT) conditions. 

From (33), it can be concluded that the allocated powers 
are equal for ZFR/ZFT when N — 2K oo. 


V. Numerical Results 

Simulations are conducted to validate the derived achievable 
rate expressions and examine the performance of the designed 
power allocation schemes, respectively. In the simulation 
study, we set the length of the coherent interval T = 200 
(symbols), the number of user pairs K = 10, the training 
length T = 2K, and the noise variance is normalized to be 
cr^ = 1. Furthermore, SNR = Pr is defined at the relay side. 



Fig. 2: Spectral efficiency versus SNR for lower bounds and 
Monte-Carlo results (pp = 10 dB, ERA, Pr = 2KPs). 



Fig. 3: Spectral efficiency versus K for lower bounds and 
Monte-Carlo results (pp = 10 dB, EPA, 

SNR = Pr = 2KPs, N = 128). 


A. Validation of Achievable Rate Results 

Eirstly, the effectiveness of the derived SCSI based achiev¬ 
able rate in (19) and (20) is evaluated by comparing the 
spectral efficiency with the Monte-Carlo simulation results. 
Eor simplicity, we assume that the large-scale fading factors 
are af = 1 for alH = 1, 2, • • • , 2K and equal power allocation 
for users is utilized with the total power 2KPs = Pr. In Eig. 2 
with pp = 10 dB, the spectral efficiency curves versus Ps 
obtained from the analytical lower bounds (19) and (20), are 
compared with the ones given by the exact capacity expression 
(14) obtained through Monte-Carlo simulation. It is evident 
that the relative performance gap between the capacity lower 
bound (20) and the exact capacity (14) for ZFR/ZFT is even 
smaller than that for MRC/MRT, especially with larger number 
of antennas. Moreover, Fig. 2 shows that the spectral efficiency 


of ZFR/ZFT increases much faster than that of MRC/MRT 
as SNR increases. It is due to the fact that the effect of 
interference is much larger than that of the noise for higher 
SNR while ZFR/ZFT is able to null multi-user interference 
signals [21]. Hence, the effectiveness of the derived closed- 
form lower bounds for both MRC/MRT and ZFR/ZFT has 
been demonstrated. 

Fig. 3 compares the spectral efficiency of MRC/MRT 
with that of ZFR/ZFT at different SNRs, which shows that 
ZFR/ZFT does not always outperform MRC/MRT in the mas¬ 
sive MIMO two-way relaying systems. As K increases, i.e., 
the average SNR (|^) of each user decreases, the noise effect 
exceeds the interference effect causing wore performance of 
ZFR/ZFT. Fig. 4 depicts the performance of ZFR/ZFT over 
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Fig. 4: Spectral efficiency versus K for lower bounds and 
Monte-Carlo results {pp = 10 dB, EPA, 

SNR = Pr = 2KPs = 0 dB). 

MRC/MRT with different N/K ratios. Different from the 
results obtained by Fig. 3, Fig. 4 shows that under a fixed 
N/K, the gains brought by ZFT/ZFR grow even though the 
average SNR (|^) of each user decreases as the number of 
user pairs K increases. These observations indicate that two 
asymptotic regimes, large-scale antenna arrays and high SNR, 
are equivalent [4]. Moreover, larger gains over MRC/MRT are 
achieved by the ZFR/ZFT processing at higher N/K. 

Next the asymptotic analyses with massive arrays for the 
two cases in Propositions 1 and 2 are examined, supposing 
Pr = 2K * Ps and EPA employed with the transmit power at 
each user satisfying pi = Ps (i = 1,2, - ■ ■ , 2K). Fig. 5 shows 
the required user transmit power Ps to achieve 1 bit/s/Hz per 
user. It is obvious from Fig. 5(a) that in case I where the 
pilot power pp is fixed, the required user transmit power is 
significantly reduced as N increases, and that the required Ps 
with ZFR/ZFT is lower than that with MRC/MRT. Regarding 
the imperfect CSI effect, less user transmit power is required 
when Pp is high. On the other hand, when pp is low and 
N is small, the required 1 bit/s/Hz achievable rate per user 
cannot be achieved even with infinite Ps, which means that 
the only way to reduce the imperfect CSI effect and thus 
achieve required spectral efficiency is to increase the number 
of antennas at the relay. For case II with Ep = 10 and the pilot 
power scaling down by pp = Fig. 5(b) shows that higher 
leads to more slowly reduced Ps, because the imperfect CSI 
effect becomes much severer when the pilot power is reduced 
faster with the increase of N. 

B. Power Allocation 

In this subsection, the proposed power allocation schemes 
in Section IV are examined in regard to the performance of 
the spectral efficiency. 

In OPA, simulations are performed assuming that Pq = 
10 dB, Pr o = 23 dB and P = 23 dB. First, we choose 



Fig. 5: Required user power to achieve 1 bit/s/Hz per user 
for MRC/MRT and ZFR/ZFT (EPA, Pr = 2KPs). 

the large-scale fading matrix as follows 

D =diag{Q.lAd 0.045 0.246 0.121 0.125 0.142 0.635 
0.256 0.021 0.123 0.257 0.856 1.000 0.899 
0.014 0.759 0.315 0.432 0.195 0.562} 

which is a snapshot of the practical setup, indicating that all 
large-scale fading factors fall into the interval [0.014,1.000]. 
Fig. 6 shows the spectral efficiency versus pp with fixed 
N = 32, 64 and 128 under both OPA and EPA. The employed 
EPA here allocates equal power to each user where the sum 
power consumed by all users achieves its maximum value P/2 
with Pr = P/2, i.e., pi = ^ for all / = 1,2,--- ,2K. 
For the OPA, algorithm 1 is utilized with the initial values 
chosen as follows: e: = 0.01, P = 10, /3 = 1.1, and 
^ ^_ (dz ^ 

1, • • • , 2K) are obtained by the EPA scheme. It can be 
observed from Fig. 6 that OPA outperforms EPA, especially 
when the number of relay antennas is high, which demon¬ 
strates the effectiveness of our proposed PA. Furthermore, 
the spectral efficiency improvement in OPA for MRC/MRT 
beamforming is always smaller than that for ZFR/ZFT under 
different pp. 

For AOPA, Fig. 7 validates the effectiveness of (29) and 
(33) at a large N and a high SNR in comparison to OPA 
and EPA. Notably, the OPA here considers fixed Pr = P/2, 
i.e., only user power allocation is performed. Fig. 7(a) shows 
that when Pr = pp = 20 dB, the obtained spectral efficiency 
in AOPA for MRC/MRT^ is almost the same as that in OPA, 
reaching high gains over EPA. However, if the SNR is reduced 
to Pr = Pp = 0 dB, the gain achieved by AOPA over EPA 
becomes rather slight even when the number of antennas is sig¬ 
nificantly increased. While regarding the ZFR/ZFT, Fig. 7(b) 

^In the AOPA for MRC/MRT, we set = l/cli-i for i = 1, ■ ■ ■ , iC 
according to the assumption in Section IV. 
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Fig. 6: Spectral efficiency versus pp (Pq = 10 dB, 
Pr,o = 23 dB, P = 23 dB). 



Fig. 7: Spectral efficiency versus N for different power 
allocation schemes (Pr = P/2). 


illustrates that when the number of antennas is large and SNR 
is high, AOPA tends to EPA as predicted by (33). Furthermore, 
Fig. 7(b) shows that AOPA outperforms EPA slightly when the 
SNR is reduced to Pr = pp = 0 dB. 

VI. Conclusion 

In this paper, closed-form ergodic achievable rate expres¬ 
sions have been derived for a multi-pair massive MIMO 
two-way AE relaying system with imperfect CSI and linear 
processing. Optimal power allocations schemes based on the 
obtained rate expressions have been shown to outperform 
equal power allocation in various scenarios. It has been found 
that the asymptotically optimal power solutions for MRC/MRT 
and ZER/ZET achieve almost the same performance as OPA 
when the SNR is high and the number of antennas at the relay 


is large. Both AOPA and OPA outperform EPA on the spectral 
efficiency. Besides, the allocated power of each user in AOPA 
is inverse to the large-scale fading factor of the channel from 
the user to the relay, and proportional to the channel from the 
relay to its destination for MRC/MRT under the condition that 
the link end-to-end large-scale fading factors among all pairs 
are equal, and the AOPA for ZER/ZET tends to be EPA when 
N is large. 


Appendix A: Proof of Equation (10) 


To prove 


(10), 

2 


we start from the expectation 

2 

E G*TG^Gx , which is rewritten as (34). Step 

(a) is obtained by E [xx^] = I 2 K and substituting 

G = GP and (4) into the equation, and step (b) results 
from the independence between G and H and the property 
Tr{AB} = Tr{BA}. As for the first term in (34), we have 

A" 

(35), where ’i' = with i/i = +P 2 iO-|i and 

2 = 1 


$ = ^ (/i with (pi = Step (a) in (35) results 

i=l 

from substituting G in (4), P and T into the equation and 
formula expansion based on the fact that the expectation of 

2K 

E E [gfegf (g2i-lg^j + S2jg^j-l) {g2^-lg2^ + g2ig2^-l)] 
k=l 

= 0 for any i ^ j. Step (b) is obtained by 

Tr {AB} = Tr {BA}, some results from Gaussian distributed 
estimated channel in (4)® [25], and [26, Lemma 2.9]. To 
elaborate in detail, for the items in (a) with any fc ^ 2/ — 1 or 
2i, we have (36) and (37), where the properties x*y^ = yx^ 
and x^y* = y^x for arbitrary vectors x and y are utilized. 
While for the items with fc = 2/ — 1 and 2i, we have (38) 
and (39). On account of the independence between G and H, 
the second term in (34) becomes (40). By substituting (35) 
and (40) into (34), we have 


E 


G*TG"Gx 


= N{N + 1) 


K 




2 = 1 


(41) 


K 


where ^ 'fpi with ipi + p 2 icrli. 

2 = 1 

To proceed, we need to calculate the expectation 

"... 2l 

E G*TG^nr in (10), which is elaborated as follows 


E 


G*TG^n, 

G*TG^n, 

K 




= Tr a^E 


G*TG^GT^G^ 


(f>) 




(c) 


iS2i-lS2^ + g2*g^-l) (g2i-lg^* + g2*g^,-l) 


2=1 


= 2N + 


(42) 


^Due to the estimated channel model in (4), we have that and gj 
are mutually independent x 1 vectors with Vi ^ j whose elements are 
i.i.d. zero-mean Gaussian distributed with variances af and respectively. 
Then, it can be concluded that E [g^^gj] = E{g^gj} = 

E [g|^gj] = 0. Also, we can obtain that E [jg^g^p] = Nafo-j. 
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E 


Tr|E 


G*TG"Gx 


(a) 

= Tr<^E 


G*TG^ (g - h) PP^ (g - h) ^GT^G 


= Tr/E 


GPP^G^GT^G^G*TG^ 


E 


hpp^h^gt^g'^g*tg^ 


GPP^G^GT^G^G*TG^ 


}‘=T^ 


. fc=l 


}+!!•{ 

K 

SkikJ2 (g2i-lC + S2zk2i-l) {s*2^-li2i + i*2iS2i-l 


} 


(34) 


i=l 


(b) 


K 


= 2N {N + 1) + N{N + if J2 


2=1 


Tr {E [gfcgf g 2 iC_ig 2 i-ig^] } = Tr {E [gfcgf ] E [g 2 *g^-ig 2 *-ig^] } = 


Tr {E [ikifi 2 i-ii 2 ii 2 i-ii 2 i\ } = Tr {E [gfcgf ] E [g 2 ^-lk 2 ^-l] E [g 2 *g^] } = Nal,_ 


1^2i^k 


(35) 

(36) 

(37) 


Tr {E [g 2 *-ig|^_i (g 2 i-iC + g 2 *g^i-i) (g 2 *-igS + g 2 ig 2 *-i)] } = 27V (AT + 1 ) o-2.o-4._^ + AT (iv + 1)2 (38) 

Tr {E [^ 2 ikf^ {k 2 ^-lk^i + k 2 ik^^-l) (g 2 i-igS + gig 2 *-i)] } = 2Ab (Ab + 1) o-|,_iCT^i + TV (TV + 1)^ (39) 


2K 


Tr|E HPP^H^GT^G^G*TG'f^j I = ^pjCrf.TriE 


T=i 


K 


E (g2^-lg^^ + g2zg^*-l) (g2i-lg^ + g2*g^-l) 


2K 


(40) 


= 2TV(TV + 1)I>^p,4. 

T=i 


where step (a) is obtained by E = <J^i-N 

Tr{AB} = Tr{BA}, (&) results from substituting G in (4), 
P and T into the equation and the fact that the expectation 

of E [(g 2 i-lg|i + g 2 zgi-l) (g 2 j-lg^l + k 2 jklj-l)] = 0 
for any i ^ j, and step (c) results from the properties of 
Tr{AB} = Tr{BA}, x*y^ = yx^ and x'^y* = y^x 
for arbitrary vectors x and y, and the properties of Gaussian 
distributed vectors, respectively, the detailed derivation of 
which is 

Tr {E [(g^i.ig^ + g2*g^-i) {k2i-ikli + g2*g^i-i)] } 
= 2E [g|^g2i-lg^_lg2z] + 2E [g^g2*] E [g^_ig2z-l] 

= 2 N{N + l)al_^al. 

(43) 

Hence, by substituting (41) and (42) into the step (a) in (10), 
the proof of (10) is completed. 


— 




— 1 oD 1 for VT, j 

i.T 


G {1, 2, • • • , 2A}, following an 

inverse Wishart distribution of VV;^(A + 2K + 1,D~^). 
Furthermore, step (c) is based on Tr{AB} = Tr{BA}. As 
to the detailed derivation of (d), we use the identity [27,28] 

E [W-i] = ^ ^ (45) 

where W ~ yVm{n, S) is an m x m central complex Wishart 
matrix with n{n > m) degrees of freedom and the distribution 

of is called an inverted Wishart distribution, following 

A 


(g^g) 


(ri + m ■ 


1, S ^). It can be easily concluded that = 


W^fiN + 2A + 1,D-i) with = (n) 


for VT, j G {1, 2, • • • , 2A}, hence 
E 


* j 




■ / ^ \ -1* 

G G 

= E 

-1 

15 

_1 


D 


A - 2A - 1 


(46) 


To 


E 


Appendix B; Proof of Equation (13) 
prove (13), likewise, we start 




from 


^ * - H ^ 

G TG Gx 

2' 

u- u u //I/IN where cov 

, which can be rewritten as (44), 

‘^k,k’^k',k' 



var[wi.i] ii,j G {1,2, 


2K 


where fj = Y. iN-2K){N-2K-Z)a‘^.a-r 

1 = 1 3 3 

based on the fact that the estimation error matrix H is 
independent of G, and step (6) is obtained by an intuitive 

property of G G = G^G = 12^, the definition of 

Pt = TPP^T = diag{p2,Pi,---P2K,P2K-i} and 

2 K 

E [HPP^H^] = YPi'^lJ-N, which is derived from the 

i—1 

-1 


where N > 2K. In this way, we have E = 

E [wfc.fc] = for fc = 1, 2, • • ■ , 2A, (47) and (48), 

{k,k' G {1,2,--- , 2A}), E [wi.i] and 
• • , 2 A}) are calculated based on the 
properties of the inverse Wishart matrix Cl [27,28]. 


Then, the calculation of E 


- * ± H 

G TG tir 


in (13) can be 


distribution of H. Here, fl = 


(g^g)' 


is defined with 


elaborated as (49). Hence, by substituting (44) and (49) into 
the step (a) in (13), the proof of (13) is completed. 

Appendix C: Proof of Theorem 1 

To derive the closed-form expression of the achievable rate 
in (17), we start from the expectation E [g^,Fgfc] in the 
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E 


n * ± H ^ 

G TG Gx 


= Tr<^ E 


(a) 

= Tr E 


= ^p,.E [if I,] + E (e \-^l. 


+ Tr<i E 

2K 


HPP^H^GTG G TG 


G*TG (g - h) PP^ (g - h) GTG 

2K 

^p,cr|.Tr|E rj*TS7Tj| 


G TG GPP^G^GTG 


= Tr<|E 


G PtG 


2K 2K 


+E 




2=1 


2 = 1 j = l 


= E 


2=1 

2K 


Pi' 


2K 


{N -2K- 1) 5-2 


^Ep*4 


(44) 


numerator based on MRC/MRT in (9), given by (50), where 
step (a) is obtained by g^/ = gk' — ^k' and the independence 
between g^/ and ^k', step (b) results from substituting F in (9), 
G in (4) and T into the equation and the fact E = 0, 

(c) is obtained by formula expansion based on the fact that 
the expectation of E [g^, 


for any * [f] and 


S2i-r§2i 


g2*gf-i)gfc] = 0 
, and step (d) results from the 
property Tr { AB} = Tr {BA} and the properties of Gaussian 
distributed vectors. 

Then, the variance of g^,Fgk in the denominator of (17) is 
(51), where step (a) indicates the definition of the variance and 
(6) results from gi = gi — and the independence between 
gi and (Vi G {1, • • ■ , 2A}). For the first term of step (6) in 
(51), we have (52), where step (6) is obtained simply by the 
property TrjAB} = TrjBAj, the properties of Gaussian 
distributed vectors, and [26, Lemma 2.91. To elaborate in 
detail, for the items of (a) in (52) with i = ^ , we. have (53), 

where the properties of E [gigf gigf] = (A + l)crflN (i = 
1, 2, • • • , 2K) resulting from the fact that vectors g^ contains 


the i.i.d. CJ\f{0,af) elements, E |0| 


= 2 (t| for arbitrary 


complex value 6 ~ CAf{0,(Jg), and [26, Lemma 2.9] are 
utilized, respectively. While for the items with i ^ ^ , we 

have (54), where TrjAB} = TrjBA} and the properties of 
Gaussian distributed vectors are utilized, respectively. Subse¬ 
quently, the left three terms of step ( 6 ) in (51) are calculated 
and expressed as (55). Substituting (52) and (55) into (51) 

K 

leads to (56), where we define 4> = ^ with (pi = 

2=1 

Similarly, we obtain (57). Substituting (50), 
(56) and (57) into (17), we have (58), where 
^k',i = (^ + 1) + 2CTfcr^,l>, 

Ck' = 2 [(A + 1) {al, - 2al,) al&l, + {af, -2al,) $ , 
and step (a) is obtained by substituting (10) into (58). In this 
way, (19) is obtained and thus Theorem 1 is demonstrated. 


Appendix D; Proof of Theorem 2 

In this appendix, we focus on the proof of the closed- 
form expression in (17) with imperfect CSI based ZFR/ZFT 
processing. First, we start from the expectation E [g^,Fgfc] 
in the numerator, given by (59), where step (a) is based on 
the fact that the estimation error matrix H is independent of 
G, i.e., ^i is independent of gj for Vi,j G {1,2,--- ,2A}, 
and step (b) is obtained by g^Fg^ = 0 : 2 % on account of 
G^FG = a2l2K- 


Then, based on the imperfect CSI based ZFR/ZFT process¬ 
ing in (13), the variance of gJ/Fg^ in the denominator of (17) 

is (60), where 0 ,,, = + (^- 2 ^ 1)4 + 44 "? 

with i,j G {1,2,-- - , 2 A}, step (a) results from the property 
TrjAB} = TrjBA}, g^,FgfcgfF«g*, = a 2 ^Sk,kSk'.k' 
and E = a^JN for Vi G jl,2,--- ,2A}, step ( 6 ) 

’ - A / - - 

is obtained by the definition of 12 = (G^Gj with 

(Aij = for G jl,2, ••• ,2A} and the fact of 

glG*{G^G*)-^ = el and (G^G*)-'G^g^, = (c) 

is just an intuitive transformation, and (d) is derived according 
to the properties of the inverse Wishart matrix in (47) and (48). 

Considering that no SIC is performed as glFgk' = 0, the 
self-interference term can be rewritten as 


SIfe' = Pfe'E 


gfe'Fgfc' 


— Pk' 0^2 


{Ok',k' 


4 A 


Similarly, we obtain 


2K 

IPfe' = E NUfe, = al 

i^k,k' 


NRfc. = 


2 2 
0:2 cr^ 


(A - 2A - 1) 0 -; 


2 + a2^alal^,fi. 


( 61 ) 


(62a) 


(62b) 


Substituting (59), (60), (61) and (62) into (17), we have (63), 
where (a) results form substituting (13) into (63). Thus (20) 
is obtained and Theorem 2 is demonstrated. 
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